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METHODS FOR OPTIMIZING 
MULTICOMPONENT FORMULATIONS 

FIELD OF THE INVENTION 

The present invention relates to methods for identifying 
optimal formulations of multicomponent systems. An optimal formulation 
is a particular combination of two or more component materials mixed in a 
specific ratio such that the formulation optimally possesses a desired set of 
properties. 

BACKGROUND 

Formulations comprising two or more materials are commonly 
encountered in the chemical and pharmaceutical fields and find use in a 
wide range of industries, including, for example, the biotechnological, 
medical, petroleum and paint industries. These multicomponent 
formulations often require optimization, namely, that a particular set of 
ingredients be combined in a preferred specific ratio in order to achieve or 
maximize a desired set of properties. 

One particularly important example of a multicomponent formulation 
in the pharmaceutical field is that of drug carriers, which may, for example, 
be composed of polymers, proteins or lipids associated to form discrete 
assemblies (e.g., liposomes, nanoparticles, emulsions or colloids). Where 
targeted drug delivery is the goal, it is typically desired that the drug carriers 
be designed to preferentially target certain tissues and cells (e.g., tumors or 
cancerous organs) and to avoid others (e.g., normal tissues and cells). Thus, 
it is often necessary to determine the particular carrier formulation which 
reaches a target with the greatest efficiency and selectivity. Such a goal can 
be accomplished by formulation optimization. More generally, formulation 
optimization is typically required whenever it is desirable to achieve or 
maximize a specific set of properties for a multicomponent system. 

The ability to obtain an optimal set of properties for a 
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multicomponent formulation is often highly dependent on the chemical 
composition and processing methods used to make the formulation. For 
example, the identity and relative amounts (i.e., ratios) of ingredients can 
dramatically affect the properties (e.g., chemical, physical or electrical) of 
5 the resultant formulation. For these reasons, when dealing with 

multicomponent formulations, it is usually necessary to evaluate a very 
large number of candidate formulations to find a formulation which 
possesses a desired set of properties. 

Presently, formulation optimization is often accomplished by 
10 individually and sequentially testing particular combinations and ratios of 

component materials by trial and error. Unfortunately, this type of 
sequential empirical evaluation of candidate formulations is time- 
consuming and inefficient, especially where several components are 
combined in various ratios. Determining an optimal formulation may 
1 5 require hundreds or even thousands (or more) of experiments, even where a 

skilled worker dismisses before testing a large number of candidate 
formulations which are unlikely to be successful. Sequential testing is 
especially frustrating or impractical when the experiments are lengthy or 
costly to perform. Moreover, individual formulation testing is usually 
20 infeasible in the case of drug formulations, especially where the drug is 

being tested in vivo (i.e., in living animals). For example, if drug targeting 
to a tumor site within an animal subject is a goal, evaluation of each 
candidate formulation would typically require excision of a tumor from a 
separate animal in order to determine whether the particular formulation 
25 successfully reached the tumor site in an effective amount. For reasons 

such as these, individual testing of candidate formulations, where evaluation 
of a large number of compositions is contemplated, is often impracticable. 
Thus, it is desired to be able to evaluate a plurality of multicomponent 
formulations simultaneously, thereby avoiding the need to individually 
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and/or sequentially test candidate formulations. 

Unfortunately, standard techniques for simultaneous analysis of a 
large number of formulations are frequently inadequate because it is often 
impossible to accurately determine the identity of a formulation possessing 

5 a desired set of properties (i.e., an optimal formulation) since such a 

determination normally requires extraction or separation from a complex 
mixture. This is especially true where only very small quantities of an 
optimal formulation are recoverable, since standard chemical analysis may 
lack either adequate sensitivity or accuracy, or both. Lack of analytical 

10 sensitivity or accuracy is particularly problematic where only a small initial 

amount of the candidate formulation is available or where the component 
ratios are varied in small increments between individual candidate 
formulations. Moreover, chemical analysis will likely be unable to 
differentiate or discriminate between similar (but distinct) formulations. In 

15 addition, in a case where a formulation contains naturally occurring lipids or 

other biomolecular components, such components are probably 
indistinguishable from the biological milieu in vivo. 

Determination of an optimal formulation is also particularly difficult 
where the composition is degraded or changed prior to expressing the 

20 desired set of properties (e.g., in the course of reaching a target site in vivo). 

In this case, chemical analysis of the formulation which reaches the target 
site would probably yield an inaccurate and meaningless result; analysis, if 
possible, would reflect the degraded or changed composition of the 
formulation at its final site, rather than the initial composition of the 

25 formulation. For example, testing may be performed on one thousand 

different initial candidate formulations, wherein two particular formulations 
differ only with respect to one ingredient: Formulation 1 comprises 10% 
ingredient X and Formulation 2 comprises 5% ingredient X. Assume that 
an initial composition containing 10% X is essential for target specificity 
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and, consequently, that of the 1,000 different formulations injected, only 
Formulation 1 reaches the desired target (i.e., Formulation 1 is the optimal 
formulation). Assume also that in the course of reaching the target site, 
Formulation 1 is either stripped of half of its amount of ingredient X or that 
5 half of its ingredient X is metabolized or degraded. In this scenario, 

chemical analysis of the formulation reaching the target site would 
incorrectly reveal the optimal formulation as one which contains 
approximately 5% ingredient X. Therefore, in this example, although 
Formulation 1 is the true optimal formulation, chemical analysis would 

10 inaccurately identify Formulation 2 as the optimal formulation. A method 

capable of identifying the initial composition of an optimal formulation is 
needed where the formulation may undergo a physical or chemical change 
prior to being evaluated for expression of a given set of properties. 

Accordingly, it has been a desideratum to provide a method to 

15 determine optimal formulations of multicomponent systems whereby a 

plurality of candidate formulations is evaluated simultaneously with respect 
to expression of a desired set of properties. 



SUMMARY OF THE INVENTION 

20 The present invention relates to methods for optimizing 

multicomponent formulations, including, for example, drug carriers. 
Multicomponent formulations for use in the present invention preferably 
comprise discrete assemblies such as liposomes, emulsions, nanoparticles or 
colloids, which remain substantially discrete during the optimization 

25 process. 

Formulation optimization may be achieved by simultaneously testing 
a large number of candidate formulations and evaluating them with respect 
to certain properties. Optimal formulations are separated from other 
candidate formulations by the presence of a desired set of properties, such 
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as targeting to tumor tissue or a site of infection, or a favorable physical 
distribution (e.g., a formulation which is in the form of a clear solution as 
opposed to a precipitate or a formulation which has a particular optical 
absorbance). An optimal formulation is identified as to its components by 
5 an associated marker which contains a signal sequence, or other chemical 

information, that is uniquely correlated with the initial composition of the 
optimal formulation. 

The present invention also permits, where there exist multiple 
optimal formulations, comparative analysis of the quantities and 
10 effectiveness of each optimal formulation within the set of optimal 

formulations. Determination of the quantity of marker associated with each 
of the optimal formulations (e.g., by signal intensity) can provide a 
histogram that is representative of the relative individual contributions of 
each formulation. The advantage of the present invention in this respect is 
1 5 the ability to discriminate between individual formulations in the resulting 

mixture, which is normally impossible using conventional analytical 
techniques that, at best, can provide only a weighted average of all 
formulations in the extracted material. 

In practice of the present invention, each candidate formulation is 
20 associated with one or more unique markers that is specific to the particular 

candidate formulation and whose identity is correlated to the initial 
composition of the formulation. Association of a marker with the discrete 
assemblies of its corresponding formulation must be sufficiently strong and 
stable so as to prevent cross-contamination of markers between different 
25 candidate formulations and appreciable loss of marker before the identity of 

the optimum formulation is determined. Each marker is associated with a 
candidate formulation by, for example, encapsulation of the marker within 
or stable attachment (chemically or physically) to the discrete assemblies of 
the formulation. After marking, the candidate formulations are evaluated 



BNSDOCID: <WO 9720076A 1_l_> 



WO 97/20076 PCT/US96/19030 

6 

with respect to exhibition or performance of a desired set of properties. 
Evaluation in the case of pharmaceutical products, particularly therapeutic 
or diagnostic agents, may comprise administration of the candidate 
formulation to a collection of live cells, either in vitro or in vivo. In the case 
5 of in vivo administration, the candidate formulations may be administered to 

an animal body intravenously or by any other appropriate administration 
route, including intramuscular, subcutaneous, percutaneous, oral ingestion 
and topical administration. 

The information correlating the composition of a given formulation 

10 and its associated marker may be encoded within the unique marker itself or 

may be recorded and cross-referenced in a separate database. This database 
comprises a record of the compositional data for each candidate formulation 
and the identity of its corresponding marker. The database may be 
recorded, stored and read using conventional data storage techniques, 

15 including physical, magnetic, electronic and optical devices (e.g., magnetic 

tapes and optical disks). Once the identity of a marker associated with a 
given formulation has been determined, the compositional data for that 
formulation may be found by accessing the appropriate database address 
which cross-references the identities of the marker and its corresponding 

20 formulation. The particular format or structure of the database, however, is 

not important to practice of the present invention. Preparation of an 
appropriate database is well understood by those skilled in the art. 

The marker of the present invention is preferably capable of being 
accurately analyzed with substantially higher sensitivity and discrimination 

25 than the components of the formulation. The increased sensitivity is 

derived from, for example, the ability of the marker to be amplified or. 
copied to yield a greater amount for analysis, or by the use of markers 
which can be accurately identified using more sensitive and specialized 
analytical techniques than those existing for the formulation ingredients 
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themselves. 

In a preferred embodiment of the invention, the marker is a synthetic 
nucleic acid (DNA or RNA) oligonucleotide between about 12 and 100 
nucleotides in length and having a sequence corresponding to a specific 
composition and component ratio of a candidate formulation. A variable 
region (or regions) contained within the oligonucleotide marker provides a 
sufficient number of oligonucleotide combinations to permit their use as 
unique markers to the candidate formulations. For example, using the four 
common nucleic acid bases for synthesis of the oligonucleotide marker 
yields at least 4 X possible unique combinations, where x is the number of 
bases in the variable region(s). Thus, where x=10, there are at least 
4 10 = 1,048, 576 unique oligonucleotide markers available. The length of the 
variable region(s) may be adjusted to provide an array of at least as many 
oligonucleotide combinations as there are candidate formulations to be 
evaluated. Association of the oligonucleotide marker with a candidate 
formulation may be obtained by encapsulation of the marker within or 
attachment of the marker to the individual assemblies or particles of the 
formulation. 

The nucleic acid marker is preferably capable of amplification by the 
polymerase chain reaction (PCR) or by any of various other nucleic acid 
multiplication schemes known in the art. In a preferred embodiment, each 
oligonucleotide is flanked by defined 5' and 3' fixed regions that allow 
primer hybridization for Klenow extension, a primer annealing site for 
cDNA synthesis, PCR amplification and efficient T7 RNA polymerase 
transcription. PCR and other amplification techniques are described in 
Davis et al. (1994) Basic Methods in Molecular Biology . 2nd edition 
(Appleton & Lange, Norwalk, CT), the disclosure of which is incorporated 
herein by reference. 

In another preferred embodiment, the marker comprises an 
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electrophone moiety which is associated with the discrete assemblies of a 
given candidate formulation. In a preferred embodiment, the electrophone 
marker may comprise a variable-length alkyl chain and a functionalized 
polyhaloarene moiety. A set of distinctly resolvable electrophoric markers 
may be prepared by varying the length of the alkyl chain and the halogen 
substitution of the aromatic moiety. The absence or presence of an 
electrophoric marker essentially constitutes a binary code which can be used 
to record the initial composition of the candidate formulation with which it 
is associated. 

As with the oligonucleotide markers, association of the electrophoric 
marker may be obtained by encapsulation of the marker within or 
attachment of the marker to the discrete assemblies of a candidate 
formulation. Alternatively, however, a component intended to be included 
in the formulation may itself be chemically modified to directly incorporate 
the electrophoric marker. In the case where the electrophoric marker is 
incorporated directly into the formulation, a small amount of a component 
suitable for inclusion in the formulation is modified (prior to preparation of 
the candidate formulations) to incorporate a linker to which is attached the 
electrophoric marker(s) or any other suitable marker. In either case, 
identification of the electrophoric marker may be accomplished using 
various techniques known to those skilled in the art for analyzing minute 
quantities of organic molecules, including gas chromatography (GC) and 
mass spectrometry (MS). 

For example, identification of the electrophoric marker may be 
performed preferably by using capillary gas chromatography, or most 
preferably, because of its detection sensitivity, electron capture gas 
chromatography (ECGC). In the case where the electrophoric marker is 
incorporated directly into the formulation by chemical modification of a 
component, it is preferred that the linker contains a photochemically labile 
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bond such that the electrophone marker is cleaved specifically upon 
irradiation, thereby releasing the marker element in this case a haloarene 
moiety, for facile analysis. 

Alternatively, a similar set of haloarenes may be used to establish 
5 markers which may be identified by mass spectrometry. In the latter case, 

uniquely resolvable molecular mass signals may be obtained by varying the 
halogen substitution of the arene moiety or by incorporating appropriate 
isotopic substitutions. Where mass spectrometry is utilized for detection, 
the techniques of matrix-assisted laser desorption/ionization (MALDI) or 
10 electrospray ionization (ESI) may be employed to generate molecular ions 

of the markers. One skilled in the art is familiar with the use of these 
techniques in conjunction with mass spectrometry. 

Aromatic halocarbons similar to the electrophone markers of the 
present invention have been used in other applications as molecular tags for 
15 the indexing of synthetic chemical combinatorial libraries. See, Gallop et 

al. (1994) J. Med. Chem. 37:1233-1251 and Ohlmeyer (1993) Proc. Natl. 
Acad. Sci. USA 90:10922-10926 . 

Due to the sensitivity with which the markers of the present 
invention can be identified, only very small quantities of each formulation 
20 are needed for optimization. Thus, the present invention is especially useful 
where the formulation is expensive or, in the case of in vivo evaluation, 
where small amounts of tissue are available and/or where the formulation is 
toxic at high dosages. 

Other features and advantages of the present invention will be 
25 apparent upon review of the detailed description of the preferred 

embodiment, the drawings and the claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

It is to be understood that the following drawings are designed for 
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the purpose of illustration only and are not intended to define or limit the 
scope of the invention. 

FIGURE 1 is a flowchart illustrating one of the preferred 
embodiments of the method of the present invention, as described in the 
5 Detailed Description below. 

FIGURE 2 depicts an example of the structure of an electrophone 
marker, including a photocleavable linker. A set of chromatographically 
resolvable markers may be generated by varying n (the length of the alkyl 
chain) and Ar (the halogen substitution of the arene moiety). Examples of 
10 Ar groups suitable for use in the present invention are also shown. 

DETAILED DESCRIPTION 

Multicomponent formulation means a preparation consisting of two 
or more chemical components combined in a specific ratio. A 

15 multicomponent formulation for use in the present invention preferably 

comprises discrete assemblies (e.g., microparticulate preparations such as 
liposomes, emulsions or colloids) which may be strongly associated with 
one or more unique identifying markers. It is further preferred that the 
association between each marker and its formulation is maintained 

20 throughout the process in which formulations are evaluated for a desired set 

of properties, and also that the assemblies are relatively stable physically 
and remain essentially discrete between different individual formulations 
(i.e., little or no chemical mixing of components between different 
formulations). 

25 A candidate formulation refers to one of a plurality of 

multicomponent formulations that is evaluated substantially simultaneously 
with respect to a desired set of properties in order to determine an optimal 
formulation, i.e., a candidate formulation which possesses or exhibits the 
desired set of properties. In some cases, the desired set of properties may 
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comprise the lack of a given response. For example, it may be desired that a 
given diagnostic or imaging contrast agent not be taken up by (i.e., is 
excluded from) a particular organ when administered. 

An optimal formulation is a candidate formulation which has been 
5 identified as possessing a desired set of properties upon evaluation. Often, 

there may be more than one optimal formulation, depending on the 
particular set of properties chosen. When multiple optimal formulations are 
determined to exist, the optimal formulations will likely possess similar 
chemical or physical attributes and constitute a family or a small number of 

10 families of related formulations. Formulation optimization refers to 

determination of the initial composition (e.g., component identities and their 
ratios) of an optimal formulation, by evaluating a plurality of candidate 
formulations substantially simultaneously with respect to the desired set of 
properties. The set of properties may consist of one or more properties 

15 (e.g., physical, chemical, electrical) which permits an optimal formulation 

to be distinguished, and preferably separated, from other candidate 
formulations. Physical separation of an optimal formulation from other 
candidate formulations facilitates identification of its associated marker(s) 
and, consequently, its initial composition. For example, where 

20 multicomponent formulations comprising microparticulates are optimized 

with respect to surface charge, evaluation of candidate formulations (and 
separation of an optimal formulation) may be accomplished by application 
of an electric field to a mixture of the candidate formulations. 

A marker refers to a component of each formulation which contains 

25 information that is capable of identifying each individual formulation. A 

marker can contain a signal sequence or other chemical information that is 
uniquely correlated with the initial composition of the optimal formulation. 
Examples of suitable markers include, but are not limited to 
oligonucleotides, electrophoric markers and the like. 

BNSDOC1D: <WO 9720076A1 J_> 
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Desired properties of multicomponent formulations in the present 
invention may include, among others, chemical, physical, electrical, and 
pharmacological properties. For example, in pharmaceutical or medical 
applications, specific targeting of a therapeutic or diagnostic agent to a 
5 particular site in vivo is often desirable. In the case of drug carrier 

formulations, liposomes (and certain other lipid-based carriers such as 
emulsions) are known to be physiologically compatible and effective as 
delivery systems for a broad range of drugs and biomolecular agents, and 
may be used to encapsulate (or incorporate) both aqueous-soluble and lipid- 
10 soluble materials. For example, United States Patent No. 5,019.369, issued 

5/28/91 entitled "Method of Targeting Tumors in Humans," teaches the 
targeting of diagnostic and chemotherapeutic agents to tumors in the body 
of a patient by intravenous administration of the liposomally encapsulated 
agent. Various methods for associating or incorporating molecules into 
15 liposomes and other lipid-based carriers are well known in the art. In 

addition, United States Patent No. 5,435,989, issued 7/25/95 entitled 
"Method of Targeting a Specific Location in a Body," and United States 
Patent No. 5,441,745, issued 8/15/95 entitled "Method of Delivering 
Miceilular Particles Encapsulating Chemotherapeutic Agents to Tumors in a 
20 Body," describe the use of liposomes to deliver anticancer agents 

preferentially to a tumor site. The disclosures of the aforementioned United 
States Patents are incorporated herein by reference. 

According to the present invention, methods for optimizing 
multicomponent formulations are described. Specifically, a plurality of 
25 candidate formulations is prepared, each formulation differing with respect 

to chemical composition (i.e., identity and/or ratio of ingredients). The 
present invention is particularly advantageous over existing techniques (e.g., 
sequential trial and error evaluation) where the number of candidate 
formulations exceeds about 10. The candidate formulations may be 
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prepared by random mixing of ingredients (which additionally requires 
chemical analysis prior to testing, in order to determine the composition of 
each formulation) or, preferably, by deliberate choice of formulation 
compositions. The appropriate volume for each individual candidate 
formulation may be readily determined by one skilled in the art, so as to 
provide a sufficient concentration of marker for evaluation and for 
identification by the particular analytical instrumentation utilized, in 
addition to consideration of other factors such as formulation cost. 

Since only very small amounts of each candidate formulation are 
necessary for evaluation, the present invention makes it more feasible to test 
a very large number of candidate formulations, particularly because the 
combined total amount or volume of the formulations is manageable. 
Moreover, in the case of oligonucleotide markers, PCR amplification or 
alternative multiplication techniques permit identification of the 
oligonucleotide marker sequence where the amount recovered of an optimal 
formulation is too small for standard chemical analysis. Hence, the 
oligonucleotide marker is capable of being accurately identified with higher 
sensitivity than the formulation itself, which typically cannot be similarly 
multiplied or amplified. PCR amplification requires, however, that a PCR 
primer or other initiation sequence be incorporated into the oligonucleotide 
marker prior to association with a candidate formulation. 

Oligonucleotides for use as markers in the present invention may be 
synthesized using any of the various techniques presently known in the art 
or subsequently developed. See, Gait, ed. (1984) Oligonucleotide 
Synthesis: A Practical Approach (IRL Press. Oxford) the disclosure of 
which is incorporated herein by reference. For example, oligonucleotides 
may be synthesized using the solid-phase method. Commonly used solid 
supports for this method of oligonucleotide synthesis include polystyrene, 
silica gel, glass beads, polyamide and cellulose paper. In a preferred 
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embodiment of the invention, the marker is a synthetic nucleic acid (DNA 
or RNA) oligonucleotide between about 12 and 100 nucleotides in length 
and having a sequence corresponding to a specific composition and 
component ratio of a candidate formulation. A variable region (or regions) 
5 contained within the oligonucleotide marker provides a sufficient number of 

oligonucleotide combinations to permit their use as unique markers to the 
candidate formulations. For example, using the four common nucleic acid 
bases for synthesis of the oligonucleotide marker yields at least 4 X possible 
unique combinations, where x is the number of bases in the variable 
10 region(s). Thus, where x=10, there are at least 4 ,0 =1, 048,576 unique 

oligonucleotide markers available. The length of the variable region(s) may 
be adjusted to provide an array of at least as many oligonucleotide 
combinations as there are candidate formulations to be evaluated. 
Association of the oligonucleotide marker with a candidate formulation may 
15 be obtained by encapsulation of the marker within or attachment of the 

marker to the individual assemblies or particles of the formulation. 

Following synthesis, association of an oligonucleotide marker with 
the discrete assemblies of a given candidate formulation may be 
accomplished by a variety of methods. For example, in the case of 
20 liposome formulations, an oligonucleotide marker may be passively 

entrapped within the aqueous core of the liposome according to any of the 
various techniques known in the art for entrapping nucleic acids. 
Alternatively, an oligonucleotide marker may be actively loaded into 
preformed liposomes, according to methods disclosed in United States 
25 Patent No. 5,227,170, issued 7/13/93 entitled "Encapsulation Process," or 

they may be incorporated into lipid constructs and lipid-based carriers, 
according to methods disclosed in United States Patent Application Serial 
No. 08/434,465, filed 5/4/95 entitled "Nucleic Acid Ligand Complexes/' 
both of which are incorporated herein by reference. 
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Alternatively, an electrophone marker may be associated with a 
candidate formulation. In the case of lipid-based multicomponent 
formulations, a lipid component may be chemically modified to incorporate 
the electrophone marker directly by, for example, attachment through a 
linker. An example of the basic structure of an electrophone marker- 
modified lipid is: 

lipid-linker-OCH 2 -(CH 2 ) n -CH 2 -0-Ar, 
where n represents the number of carbon units in the variable-length alkyl 
chain and Ar is a haloarene moiety and may comprise, for example, C 6 C1 5 , 
C 6 H 2 C1 3 or C 6 H 2 C1 2 F. The lipid portion may be composed of any 
appropriate lipid compound, and is preferably, in the case of drug carriers 
for in vivo applications, pharmaceutical ly acceptable. Most preferably, the 
lipid is distearoyl phosphatidylethanolamine (DSPE). The linker can be any 
Afunctional agent which couples the lipid to the alkyl haloarene moiety. A 
set of resolvable markers may be generated by varying n and Ar. The value 
of n may, for example, range from 3 to 12. 

The set of modified lipids is used to tag a candidate formulation by 
including only certain members of the set in a given candidate formulation 
at a level between about 0.0001 and 10% (most preferably between about 
0.01 and 1%) by weight each. If, as in the marker set described above, there 
are potentially 30 members of the electrophone marker set (i.e., n has 10 
possible values, Ar may be one of three haloarene species), then 
accordingly, 2 30 = 1,073,741,824 distinct formulations can be differentiated 
using this method. However, depending on the particular analytical 
technique employed, not all 30 members of the set may be distinctly 
resolvable. One skilled in the art though may readily determine a 
sufficiently large marker set by selecting those markers which can be 
adequately resolved by the given analytical technique. Smaller marker sets 
comprising fewer formulations may be utilized where evaluating fewer 
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candidate formulations. The concentration of the modified lipid for 
inclusion in a given formulation is governed on the lower end only by the 
detection limit of the identification technique; the upper concentration limit 
is set according to that amount which, when incorporated in a formulation, 
5 alters the physical or chemical properties of the formulation. 

Various methods can be used to identify the electrophone marker, 
including gas chromatography and mass spectrometry. Electron capture gas 
chromatography (ECGC) is particularly preferred for analysis of the 
electrophone markers because it has excellent sensitivity and 

10 discrimination, allowing the electrophone marker to be accurately identified 

at significantly lower levels than the formulation with which it is associated. 

In a preferred embodiment, the electrophone marker is attached to 
the component via a linker and the linker contains a photochemically labile 
bond such that the electrophoric marker is cleaved specifically upon 

15 irradiation and thereby released for facile analysis. One example of a 

photolabile moiety for inclusion in the linker is o-nitrobenzyicarbonate. 

Following association of the markers (e.g., oligonucleotide or 
electrophoric) with their corresponding candidate formulations, the 
candidate formulations may, in the case of drug carrier optimization, be 

20 administered intravenously (i.v.) or by another administration route (e.g., 

topically, orally, subcutaneously) into an animal for evaluation with respect 
to expression of a desired set of properties. The candidate formulations may 
also be administered in vitro to cells in culture for evaluation. 

In one embodiment of the invention, drug carrier formulations are 

25 optimized with respect to their ability to enter the cytoplasm of a cell, and 

further yet, the cell nucleus. For example, delivery of DNA to the nuclei of 
cells is necessary for effective gene therapy. Liposomes are useful as 
carriers for cellular drug delivery formulations utilizing therapeutic DNA 
and RNA oligonucleotides. Naked or otherwise unprotected 
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oligonucleotides in their phosphodiester form are quickly degraded in body 
fluids by intracellular and extracellular enzymes, such as exonucleases and 
endonucleases, before they reach the target cell. Encapsulation within 
liposomes protects oligonucleotides from enzymatic degradation and may 
5 increase cellular uptake and delivery as a result of phagocytosis of the 

liposomes. Cationic liposomes and lipid constructs comprising 
aminomannose-derivatized cholesterol are particularly effective for 
introducing nucleic acids into cells, as described in United States Patent 
Application Serial No. 08/386,577, filed 2/10/95 entitled "Lipid Constructs 
10 for Cytoplasmic Delivery of Agents," the disclosure of which is 

incorporated herein by reference. In the case of therapeutic nucleic acid 
delivery, the formulation typically comprises, in addition to the therapeutic 
nucleic acid, a distinct identification marker, which may itself be an 
oligonucleotide. In cases where delivery is being evaluated, the marker can 
15 serve as the surrogate drug as well. 

For formula optimization for gene therapy applications, each 
candidate formulation optionally incorporates a nucleic acid sequence 
which is capable of producing expression of a signal in target cells in a 
characteristic and identifiable manner. For example, the formulations may 
20 be prepared to incorporate a piasmid which is known to encode for a protein 

that produces a fluorescent signal at a characteristic wavelength when 
irradiated. See, Chalfie et al (1994) Science 263:802-805). Accordingly, 
detection of fluorescence attributable to such a protein indicates effective 
delivery of a given candidate formulation to the nuclei of the target cells, 
25 and consequently, identification of an optimal formulation for gene therapy 

applications. 

Where oligonucleotide markers are used, short strands of DNA or 
RNA oligonucleotides (less than approximately 100 nucleotides in length) 
are synthesized with known sequences. In a preferred embodiment, each 
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oligonucleotide is flanked by defined 5 r and 3' fixed regions that allow 
primer hybridization for Klenow extension, a primer annealing site for 
cDNA synthesis, PCR amplification and efficient T7 RNA polymerase 
transcription. Most preferably, the various oligonucleotide markers are then 
associated with the discrete assemblies of the candidate formulations and a 
database, cross-referencing the compositional data of each candidate 
formulation with the identity of its associated marker, is established. The 
marker may be associated with a candidate formulation by encapsulation 
within or attachment (by either physical or chemical means) to the discrete 
components of the candidate formulation. In the case of liposome 
formulations intended for in vivo evaluation, the oligonucleotide marker is 
preferably encapsulated within the aqueous interior for protection from 
degradative enzymes. 

The candidate formulations, each tagged with a unique marker, are 
subsequently evaluated with respect to expression of a desired set of 
properties. For example, drug carriers incorporating oligonucleotide or 
electrophone markers are injected i.v. into a tumor-containing animal and 
evaluated to determine the extent of drug carrier targeting to the tumor site. 
After a sufficient time, the tumor may be excised from the animal and 
analyzed for the presence of markers. Recovered oligonucleotide markers 
may be amplified using, e.g., PCR technology, and sequenced. Methods of 
sequencing DNA or RNA strands of this length are well known to those 
skilled in the art. Alternatively, electrophoric markers may be identified 
using mass spectrometry or gas chromatography (e.g., capillary GC or 
ECGC). The marker identity, once ascertained, is used to determine the 
corresponding initial formulation composition either directly (where the 
compositional data of a formulation is encoded within the marker) or by 
reference to a separate database. The formulation(s) recovered in the 
greatest amount is (are) typically considered optimal formulation(s). In 
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some instances, however, optimal formulations may be those formulations 
which are recovered in the least amount following evaluation. 

Where the number of candidate formulations for evaluation exceeds 
the practical limit for a single experiment, as determined by, for example, 
the total maximum volume which can be evaluated at one time, the process 
may be repeated successively to the extent necessary. One skilled in the art 
will readily appreciate the various embodiments of the present invention 
and considerable flexibility can be afforded the practical implementation of 
the claimed methods. 

Where liposomes or lipid constructs constitute the formulation to be 
optimized, they may be constructed using a variety of amphiphilic 
molecules, which consist of polar (hydrophilic) and non-polar (lipophilic) 
portions. Examples of hydrophilic portions include phosphato, 
glycerylphosphato, carboxy, sulfato, amino, hydroxy, choline and other 
polar groups. Examples of non-polar groups are saturated or unsaturated 
hydrocarbons such as alkyl, acyl, alkenyl or other lipid groups. The chain 
length of each of these non-polar groups may additionally be varied to alter 
the physical and pharmacological properties of the resultant formulations. 
Also, sterols (e.g., cholesterol), surface-modifying groups (e.g., 
carbohydrates), and other pharmaceutical!}' acceptable adjuvants, including 
anti-oxidants like alpha-tocopherol, may be included in the formulation. 
Other liposome or lipid-based drug carrier ingredients suitable for particular 
applications are known to workers skilled in the art. See, e.g., New, ed. 
(1992) Liposomes: A Practical Approach (IRL Press, Oxford). Similarly, 
where multicomponent formulations other than lipid-based assemblies are 
contemplated for optimization, appropriate components may be chosen by 
those skilled in the art. 

Referring generally to FIG. 1, a flowchart summarizing one preferred 
embodiment of the present invention is depicted. This flowchart outlines 



97/20076 , PCT/US96/19030 

20 

the method of the present invention for optimizing drug carrier formulations 
for in vivo targeting in a body. In this embodiment, an animal tumor cell is 
targeted using various drug carrier formulations. 

Referring in detail to FIG. 1, solvents containing one or more 
components are conducted through a series of input tubes "A." The solvents 
from input channels "A" are mixed in specific proportions in chamber "B" 
to form a first candidate formulation, followed successively (in separate 
mixing steps) by the preparation of other candidate formulations. The 
candidate formulation in each given step (each formulation containing, for 
example, 100 ^L) is then individually tagged with a unique marker (e.g., a 
synthetic oligonucleotide or an electrophoric moiety) supplied by "C." Each 
marker is programmed to correspond to a specific component ratio in each 
successive candidate formulation, such that each successive input stream 
from M C" comprises a solvent and one or more markers, synchronized with 
the discrete 100 ]nL zones of each candidate formulation. The procedure for 
associating a marker with a candidate formulation will depend on the nature 
of the association (e.g., encapsulation, external binding, inclusion as an 
integral component); appropriate procedures for associating markers with 
candidate formulations may be determined by one skilled in the art. In the 
case where the marker is an oligonucleotide encapsulated within liposomes, 
chamber "B" contains machinery necessary to prepare lipid vesicles (e.g., 
homogenizer). Once the formulations are tagged with the markers, it is 
unimportant whether the individual candidate formulations are maintained 
as discrete zone packets or are combined (i.e., admixed). 

Optionally, the candidate formulations may be subjected to size 
selection before evaluation. For example, the candidate formulations may 
be passed through device M D M which separates and discharges components 
with sizes below a minimum cutoff value, and may then be conducted 
through filter "E" which passes components with sizes below a maximum 
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cutoff value. 

The marked formulations may be eventually collected in vessel "F" 
and subsequently administered (e.g., by injection) into a test animal "G" 
having a tumor or tumors "H." Following an appropriate treatment time, a 
5 tumor is excised from the animal and is treated to release individual cells 

into solution. The released cells are then input into device "I" which 
separates cells from supernatant based on a signal associated with the cells. 
For example, "I" may comprise a fluorescence-activated cell sorter (FACS) 
which separates cells that express a fluorescence signal. The signal- 

10 expressing cells are then passed through " J, M which analyzes and identifies 

the markers associated with the tumor cells (i.e., optimal formulation 
markers). In the case where the marker is an oligonucleotide, "J" may 
comprise instrumentation for amplifying the markers (e.g., by PCR) and 
sequencing the nucleic acids. Where the marker is an electrophone 

15 haloarene, "J" may comprise, for example, mass spectrometric or gas 

. chromatographic instrumentation. With the identity of the markers known, 
the initial compositions of the optimal formulations can be determined, for 
example, from the marker directly or by reference to database "K" which 
contains information cross-referencing the identity of a marker with the 

20 composition of the candidate formulation with which it was originally 

associated in chamber f, B. n 

EXAMPLE 1 

FORMULATION OPTIMIZATION FOR IN VIVO TARGETING 
25 OF LTPID-BASED DRUG CARRIERS USING OLIGONUCLEOTIDE 

MARKERS 

Lipid-based drug carrier formulations are prepared by combining, 
four component ingredients and optimized for their ability to target specific 
tissue sites in vivo. 
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Component samples consisting of the lipids 
dipalmitoylphosphatidylcholine (DPPC), dioleoylphosphatidylethanolamine 
(DOPE), dioleoyltrimethylammonia (DOTMA), and cholesterol are 
separately dissolved in ethanol to permit their individual introduction to 
5 mixing chamber "B" via input channels "A" shown in FIG. 1. DNA 

oligonucleotides (62-nucleotides in length) having the sequence: 

5'- ATCCAATCTCTCCACATCTCTATACTATCATCACC - 

N I N 2 N 3 N 4 N 5 N 6 N 7 -CTCACTCACTTCCATTCCAC -3' 

(SEQ ID NO:l) 

1 0 are prepared using solid phase synthesis. Synthesis of the variable base 

region N, through N 7 is programmed to provide a unique sequence for each 
oligonucleotide marker. The input channels "A" of FIG. 1 are programmed 
to increment each lipid channel in 10 mole% steps with respect to the other 
lipid channels which are also incremented in 10 mole% steps, taking into 

15 account the concentration of lipid in each solvent. In this example, the 

number of distinct oligonucleotide markers (using a variable string of seven 
bases) is 4 7 = 16,384, sufficient to uniquely mark each possible candidate 
formulation. 

Each oligonucleotide marker is associated with the discrete 
20 assemblies of its corresponding candidate formulation (volume = 100 (iL) in 

chamber "B" of FIG. 1, where the candidate formulations are successively 
and uniquely tagged. The marked candidate formulations are filtered by 
size to remove free oligonucleotide marker and large lipid particles (e.g., 
aggregates). After collection of the size-selected formulations in vessel "F" 
25 of FIG. 1, the volume of the solution is reduced by membrane filtration (300 

kilodalton cutoff) to provide a lipid concentration of 25 mg/mL. 

The collected, concentrated mixture of marked candidate 
formulations is injected into the lateral tail vein of CD 2 F, mice bearing 
PI 798 lymphosarcoma tumors that had been implanted intradermally in the 
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flanks of the mice 4 days prior to injection. An injection volume of 0.2 mL 
is used per mouse (each mouse approximately 20 grams weight) and a 
cohort of 10 mice is used. The mice are sacrificed 24 hours after injection 
and dissected. Blood, tumor, liver, spleen, kidney, heart, lungs, bone, 
5 intestines, brain, muscle and skin are separately collected. The tissues are 

immediately homogenized individually and a fraction enriched in the 
oligonucleotide marker is collected for each tissue. This fraction is used in 
a PCR amplification apparatus using primer sequences complementary to 
the 15 bases at the 5' and 3' ends of the encoding DNA whose sequence is 

10 provided above. The amplified DNA of each tissue is sequenced and 

decoded. Optimal formulations are identified by the extent of their 
accumulation in the tissues. From the oligonucleotide sequence, the optimal 
formulation may be identified by reference to the database containing 
information cross-referencing the identity of a particular oligonucleotide 

15 marker and the lipid formulation with which it was originally associated. 

EXAMPLE 2 

FORMULATION OPTIMIZATION FOR CELLULAR 

DELIVERY OF PLASMIDS USING ELECTROPHORIC MARKERS 

20 Distearoylphosphatidylethanolamine (DSPE) is coupled by 

condensation to the electrophone moiety shown in FIG. 2, and the product 
is purified by reverse phase chromatography. Two modified lipids are 
generated, where Ar=C 6 Ci 5 and n=3 in one case and n=4 in the other. A 
spray-dried powder is then produced consisting of 79.9% (by weight) 

25 distearoylphosphatidylcholine (DSPC), 20.0% cholesterol (by weight), and 

0.1% (by weight) of the n=3 modified DSPE. Hydration of 100 mg of the 
powder in 4.0 mL of water at 65°C is followed by sonication to yield 
unilamellar liposomes of 60 nm average diameter. A similar preparation is 
made using dioleoylphosphatidylethanolamine (DOPE)/6-aminomannose 
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cholesterol (1:1 mole ratio), with 0.1% n=4 marker. 

A quantity of 1.0 mg plasmid DNA is added and incorporated into 
1 .0 mL of each of the two candidate liposome formulations. The plasmid 
encodes for a protein that produces a fluorescent signal upon irradiation. 

5 The formulation samples are applied simultaneously to a suspension of 

PI 798 tumor cells in culture. The cells are incubated for 4 hours with the 
candidate liposome formulations and then sorted by fluorescence-activated 
cell sorting (FACS) using the fluorescent protein signal to gate the 
collection. The suspension of high fluorescence cells is centrifuged and the 

10 pellet is homogenized. The dried homogenate is irradiated at 365 nm to 

break the photolabile linker and release the electrophone marker, which is 
then analyzed by ECGC. The predominant signal is from the n=4 marker, 
indicating the superiority of liposome formulations comprising DOPE/6- 
aminomannose cholesterol over similar liposome formulations comprising 

15 DSPC/cholesterol for delivery of plasmids into cells. 



EXAMPLE 3 

MI n TTPT £ FORMULATION ENCODING 
USING ELECIRQBHQRIC MARKERS 
20 The procedure and materials of Example 1 are followed except that 

electrophoric moieties rather than DNA oligonucleotides are used as 
markers. 

The candidate formulations are tagged using fourteen programmed 
syringe pumps containing solutions of DSPE modified with electrophoric 
25 markers as in Example 2, comprising the series n=3 to n=12 for Ar=C 6 H 5 

and n=4 to n=7 for Ar=C 6 H 2 Cl 3 . The number of distinct electrophoric 
markers is therefore 2 ,4 = 16,384. The syringe pumps are programmed to 
deliver 0.1% by weight of the modified lipids to mark each of the candidate 
formulations. The formation of lipid carriers and animal experimentation 
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are carried out as in Example 1 . 

After tissue collection, the samples are homogenized and dried. The 
samples are subjected to irradiation at 365 nm and the released electrophone 
markers are analyzed by ECGC. The pattern of peaks is used to determine 
5 the formulation(s) most actively taken up in each tissue type. 

Although the present invention has been described in some detail by 
way of illustration and example for purposes of clarity of understanding, it 
will be readily apparent to those of ordinary skill in the art in light of the 
teachings of this invention that various changes and modifications may be 
1 0 made thereto without departing from the spirit or scope of the claims. In 

addition, changes in form and the substitution of equivalents are 
contemplated as circumstances may suggest or render expedient. Therefore, 
the foregoing description should not be construed to limit the scope of the 
present invention, which is set forth in the appended claims. 
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CLAIMS : 

1. A method for optimizing a multicomponent formulation comprising 
the steps of: 

(a) preparing a plurality of candidate formulations of known 
5 initial composition; 

(b) associating with each candidate formulation a marker of 
known identity, each marker being uniquely correlated to the 
candidate formulation with which it is associated; 

(c) evaluating the marked candidate formulations for at least one 
10 desired property to determine an optimal formulation; 

(d) identifying the marker which is associated with the optimal 
formulation; and 

(e) determining the initial composition of the optimal formulation 
using the marker identity obtained in (d). 

15 

2. The method of claim 1 wherein the plurality of candidate 
formulations comprises a number greater than 10. 

3. The method of claim 1 wherein said marker is selected from an 
20 oligonucleotide or an electrophone moity. 

4. The method of claim 3 wherein said oligonucleotide is selected from 
RNA or DNA. 



25 5. The method of claim 3 wherein the oligonucleotide is between 12 

and 100 nucleotides in length. 

6. The method of claim 3 wherein the electrophone marker is 
comprised of a haloarene moiety. 
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7. The method of claim 6 wherein said haloarene moiety is selected 
from the group consisting of C 6 C1 5 , C 6 H 2 C1 3 and C 6 H 2 C1 2 F. 

8. The method of claim 6 wherein said electrophone marker is 
5 comprised of a C 3 to C 12 alkyl chain and a haloarene moiety. 

9. The method of claim 8 wherein said electrophoric marker is attached 
to a lipid. 

10 10. The method of claim 9 wherein said lipid is present at a 

concentration of about 0.0001% to 10% by weight of the formulation. 

1 1 . The method of claim 9 wherein said lipid is present at a 
concentration of about 0.01% to 1% by weight of the formulation. 

15 

12. The method of claim 9 wherein said electrophoric marker is attached 
to said lipid via a linker. 

13 The method of claim 12 wherein said linker comprises a photolabile 
20 linkage. 

14. The method of claim 1 wherein one or more of the formulation 
components comprises an amphiphilic compound. 

25 15. The method of claim 14 wherein the amphiphilic compound is a 

lipid. 

16. The method of claim 15 wherein the lipid is 
distearoylphosphatidylethanolamine. 
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1 7. The method of claim 1 5 wherein the lipid is cationic. 

1 8. The method of claim 1 wherein one or more of the formulation 
components comprises a sterol or a derivative thereof 

5 

19. The method of claim 1 8 wherein the sterol is cholesterol. 

20. The method of claim 1 8 wherein the sterol derivative is 
aminomannose-derivatized cholesterol. 

10 

21. The method of claim 1 wherein the candidate formulations comprise 
a drug carrier. 

22. The method of claim 2 1 wherein the drug carrier is a liposome; 

15 

23. The method claim 1 whereby determination of the initial composition 
of the optimal formulation is accomplished by reference to a database which 
contains information cross-referencing the identity of the marker and the 
identity of its associated formulation. 

20 

24. The method of claim 1 wherein said plurality of candidate 
formulations are evaluated substantially simultaneously. 

25. A method for optimizing a multicomponent formulation comprising 
25 the steps of: 

(a) preparing a plurality of candidate formulations of known 
initial composition; 

(b) associating with each candidate formulation a marker of 
known sequence; 
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(c) administering the marked candidate formulations to a 
collection of live cells; 

(d) evaluating the marked candidate formulations for at least one 
desired property to determine an optimal formulation; and 

5 (e) determining the initial composition of the optimal 

formulation. 

26. The method of any one of claim 25 wherein the plurality of candidate 
formulations comprises a number greater than 10. 

10 

27. The method of claim 25 wherein said marker is an oligonucleotide. 

28. The method of claim 26 wherein said marked formulations are 
evaluated by amplifying said oligonucleotide marker associated with the 

15 optimal formulation to an extent sufficient to permit sequencing of the 

marker. 

29. The method of claim 28 wherein said oligonucleotide marker is 
amplified by a polymerase chain reaction. 

20 

30. The method of claim 25 wherein said marker is an electrophone 
moity of known chemical formula. 

3 1 . The method of claim 28 wherein said electrophone marker is 
25 evaluated using a gas chromatographic technique. 

32. The method of claim 29 wherein said gas chromatographic technique 
is capillary gas chromatography. 
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33. The method of claim 29 wherein said gas chromatographic technique 
is electron capture gas chromatography. 



34. The method of claim 30 wherein said electrophoric marker is 
5 evaluated using a mass spectrometric technique. 

35. The method of claim 25 wherein the cell collection comprises an in 
vitro cell culture. 

10 36. The method of claim 25 wherein the cell collection comprises a 

plant. 

37. The method of claim 25 wherein the cell collection comprises a 
human or non-human animal body. 

15 

38. The method of claim 25 wherein one or more of the formulation 
components comprises an amphiphilic compound. 

39. The method of claim 38 wherein the amphiphilic compound is a 
20 lipid. 

40. The method of claim 39 wherein the lipid is 
distearoylphosphatidylethanolamine. 

25 41. The method of claim 39 wherein the lipid is cationic. 

42. The method of claim 25 wherein one or more of the formulation 
components comprises a sterol or a derivative thereof. 
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43. The method of claim 42 wherein the sterol is cholesterol. 



44. The method of claim 43 wherein the sterol derivative is 
aminomannose-derivatized cholesterol. 

5 

45. The method of claims 25 wherein the candidate formulations 
comprise a drug carrier. 

46. The method of claim 45 wherein the drug carrier is a liposome. 

10 

47. The method claim 25 whereby determination of the initial 
composition of the optimal formulation is accomplished by reference to a 
database which contains information cross-referencing the identity of the 
marker and the identity of its associated formulation. 

15 

48. The method of claim 25 wherein means for administration to the 
animal body is selected from the group consisting of intravenous, 
intramuscular, subcutaneous, percutaneous, oral, ingestion and topical 
application. 

20 

49. The method of claim 25 wherein the one or more desired properties 
comprises targeted delivery of the formulation to a specific tissue within a 
human or non-human animal body. 

25 50. The method of claim 49 wherein the targeted tissue is a tumor. 

51. The method of claim 25 wherein said plurality of candidate 
formulations are evaluated substantially simultaneously. 
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